Amendments to the Claims 

1 . (Currently Amended) A method for processing video comprising: 
receiving a video signal; 

receiving a first audio signal containing annotations , wherein each annotation is 
preceded by a keyword to specify a type of that annotation ; 

receiving a second audio signal containing environmental sounds corresponding 
to the video signal; and 

converting the annotations into searchable annotations organized as hierarchical 
shot clusters using a voice-to-text conversion system. 

2. (Previously Presented) The method of claim 1 further comprising removing the 
annotations from the second audio signal. 

3. (Previously Presented) The method of claim 2 wherein removing the annotation 
from the second audio signal further comprises utilizing a least-mean-square 
algorithm. 



4. (Previously Presented) The method of claim 1 further comprising: 
generating a center text title via the searchable annotations; and 
generating a scrolling text banner via the searchable annotations. 



5. (Previously Presented) The method of claim 1 further comprising generating a 
video abstract via the first and second audio signals, the video signal and the 
searchable armotations. 
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10. (Currently Amended) A system for processing video comprising: 
a display device; 

a video signal displayed on the display device; 

a first audio signal containing annotations , wherein each annotation is preceded 

by a keyword to specify a type of that annotation ; 
a second audio signal containing environmental sounds corresponding to the 

video signal; and 

a voice-to-text conversion system that converts the annotations into searchable 
annotations organized as hierarchical shot clusters. 

1 1 . (Previously Presented) The system of claim 10 further comprising a processor to 
remove the annotations from the second audio signal. 

12. (Previously Presented) The system of claim 1 1 wherein the processor is further 
to: 

generate a center text title with the computer searchable annotations; and 
generate a scrolling text banner with the computer searchable annotations. 

13. (Previously Presented) The system of claim 1 1 wherein the processor is further to 
generate a video abstract via the first and second audio signals, the video signal 
and the searchable annotations. 

14. (Previously Presented) The system of claim 10 wherein the video signal is 
received from a video recorder. 
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15. (Previously Presented) The system of claim 10 wherein the first and second audio 
signals are received from at least one microphone. 

16. (Currently Amended) A machine-readable medium having data stored thereon 
representing sets of instructions which, when executed by a machine, cause the 
machine to: 

receive a video signal; 

receive a first audio signal containing annotations , wherein each 

annotation is preceded by a keyword to specify a type of that annotation : 
receive a second audio signal containing environmental sounds corresponding to 

the video signal; and 
convert the annotations into searchable annotations organized as hierarchical shot 

clusters using a voice-to-text conversion system. 

17. (Previously Presented) The machine-readable medium of claim 16 wherein the 
sets of instructions, when executed by the machine, further cause the machine to 
remove the annotations from the second audio signal. 

18. (Previously Presented) The machine-readable medium of claim 16 wherein the 
sets of instructions, when executed by the machine, ftirther cause the machine to: 
generate a center text title via the searchable annotations; and 

generate a scrolling text banner via the searchable annotations. 
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19. (Previously Presented) The machine-readable medium of claim 16 wherein the 
sets of instructions, when executed by the machine, further cause the machine to 
generate a video abstract via the first and second audio signals, the video signal 
and the searchable annotations. 

20. (Currently Amended) An apparatus comprising: 
an analog to digital (AA^) converter; and 

a processor coupled to the AA^ converter, the processor to 
receive a video signal, 

receive a first audio signal containing annotations , wherein each annotation is 
preceded by a keyword to specify^ a type of that annotation , 

receive a second audio signal containing environmental sounds corresponding to 
the video signal, and 

convert the annotations into searchable annotations organized as hierarchical shot 
clusters using a voice-to-text conversion system. 

21 . (Previously Presented) The apparatus of claim 20 wherein the processor is further 
to remove the annotations from the second audio signal. 

22. (Previously Presented) The apparatus of claim 20 wherein the processor is further 
to: 

generate a center text title with the computer searchable annotations; and 
generate a scrolling text banner with the computer searchable annotations. 
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23. (Previously Presented) The apparatus of claim 20 wherein the video signal is 
received from a video recorder. 
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